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Abstract 

Background: For a two-breed crossbreeding system, Wei and van der Werf presented a model for genetic evaluation 
using information from both purebred and crossbred animals. The model provides breeding values for both purebred 
and crossbred performances. Genomic evaluation incorporates marker genotypes into a genetic evaluation system. 
Among popular methods are the so-called single-step methods, in which marker genotypes are incorporated into a 
traditional animal model by using a combined relationship matrix that extends the marker-based relationship matrix 
to non-genotyped animals. However, a single-step method for genomic evaluation of both purebred and crossbred 
performances has not been developed yet. 

Results: An extension of the Wei and van der Werf model that incorporates genomic information is presented. The 
extension consists of four steps: (1) the Wei van der Werf model is reformulated using two partial relationship matrices 
for the two breeds; (2) marker-based partial relationship matrices are constructed; (3) marker-based partial relationship 
matrices are adjusted to be compatible to pedigree-based partial relationship matrices and (4) combined partial 
relationship matrices are constructed using information from both pedigree and marker genotypes. The extension of 
the Wei van der Werf model can be implemented using software that allows inverse covariance matrices in sparse 
format as input. 

Conclusions: A method for genomic evaluation of both purebred and crossbred performances was developed for a 
two-breed crossbreeding system. The method allows information from crossbred animals to be incorporated in a 
coherent manner for such crossbreeding systems. 



Background 

Production systems based on crossbreeding are predomi- 
nant in pig and chicken breeding and take advantage of the 
increased performance of crossbred animals compared to 
purebred animals. For a two-breed crossbreeding system, 
Wei and van der Werf (Appendix 2 in [1]) presented a 
model for genetic evaluation using information from both 
purebred and crossbred animals. The model provides esti- 
mated breeding values for purebred (mating with own 
breed) and crossbred (mating with the other breed) per- 
formances that are different but correlated. The model is 
particularly attractive since it can fit a breeding goal that 
includes both purebred and crossbred performances (see 
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Jiang and Groen [2]). This model is the starting point of 
our paper. 

Genomic selection [3] has offered a new paradigm for 
livestock breeding and has been successfully applied for 
selection within purebred populations [4-6]. Moreover, 
genomic selection also offers greater opportunities for 
incorporating information from crossbreds and selecting 
for crossbred performance [7-9]. Genomic selection of 
purebreds for crossbred performance was proposed by 
Ibanez-Escriche et al. [7] that used phenotypes on cross- 
breds only, and a genomic model with breed of origin 
specific allele substitution effects. The resulting breeding 
values for purebred animals were for crossbred perfor- 
mance. Although the study included genomic data, it 
was less sophisticated than the Wei and van der Werf 
model [1] since each animal had only one breeding value 
and phenotype recordings in purebreds were not used. 
In addition, it assumed that all relevant animals were 
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genotyped, which would not be a very likely scenario in 
practice. 

In cases in which not all animals are genotyped, the 
so-called single-step methods [10-12] provide a coherent 
approach for genomic evaluation. These methods incor- 
porate marker genotypes into a traditional animal model 
[13] by using a combined relationship matrix that extends 
the marker-based relationship matrix of VanRaden [14] to 
non-genotyped animals, and they have been shown to per- 
form well for genomic evaluation of dairy cattle [11,15], 
pigs [16,17] and chickens [18]. Misztal et al. [19] provided 
an extension with "unknown-parent groups" to allow for 
different populations, but using such an approach on data 
from both purebred and crossbred animals would assume 
equal genetic variances in the two breeds and in the 
crossbreds, and also that breeding values for purebred 
and crossbred performances are the same. A single-step 
method for genomic evaluation of both purebred and 
crossbred performances has not been developed yet. 

In a genomic model, when crossbred animals are geno- 
typed, it is natural to split the additive genetic effect 
of crossbreds into breed of origin specific genetic com- 
ponents, as in Ibanez-Escriche et al. [7]. Each of these 
components is a partial genetic effect, in the sense that 
only breed-specific alleles are used. This use of the ter- 
minology "partial genetic effect" is consistent with the 
model of Garcia-Cortes and Toro [20], in which for multi- 
breed analysis the additive genetic value is split into sev- 
eral independent parts depending on their genetic origin, 
with the variance-covariance structure of each part being 
determined by a partial relationship matrix (constructed 
from pedigree). A partial relationship matrix is a relation- 
ship matrix that describes relationships only according to 
genetic origin. From this point of view, partial relationship 
matrices are key when constructing a single-step method 
for both purebred and crossbred performances. However, 
the Wei and van der Werf model is not formulated using 
partial relationship matrices, and it therefore needs to be 
reformulated for the purpose of incorporating genomic 
information. 

The aim of this paper is to present an extension of the 
Wei van der Werf model that incorporates genomic infor- 
mation. The extension consists of four steps: (1) the Wei 
van der Werf model is reformulated using two partial 
relationship matrices [20] for the two breeds; (2) marker- 
based partial relationship matrices similar to VanRaden 
[14] are constructed; (3) marker-based partial relationship 
matrices are adjusted to be compatible to pedigree-based 
partial relationship matrices, similar to Christensen et al. 
[17] and (4) combined partial relationship matrices are 
constructed using information from both pedigree and 
marker genotypes, similar to the combined relationship 
matrix of Legarra et al, Aguilar et al. and Christensen and 
Lund [10-12]. This extension of the Wei and van der Werf 



model can be implemented using software that allows 
inverse covariance matrices in sparse format as input. 

Methods 

The Wei and van der Werf model 

Here, the Wei van der Werf model (Appendix 2 in [1]) is 
presented. The two breeds are named A and S, and it is 
assumed that all crossbred animals AB have known pure- 
bred parents. The number of animals in the pedigree is 
nj[ and ns for breed A and breed S, respectively, and the 
number of crossbred animals is nj^B- The model for the 
phenotypes is a trivariate model 



y^ = ^aPa + '^A^A + e^, 

YAB = ^AbPaB + ^AB + ^ABy 



(1) 



where the vectors y^, and contain phenotypes 
on the breed A breed B and crossbred AB animals, 
respectively, and for the three breed groups A B and 
ABy the vectors X^jS^, X^Ps ^AbPab contain 
fixed effects, and ~ 7V(0,cr^I), ~ N(0,a^l) 
and e^23 ~ A/^(0, cr^^I) are the residual error vectors. 
The «^ -dimensional vector contains breeding values 
for purebred performance for breed A animals (mat- 
ing within breed A), and matrix is an incidence 
matrix assigning breeding values to records. Vector as 
and matrix Zs are defined similarly for breed S. Finally, 
the -dimensional vector Cj{b contains the additive 
genetic effects for crossbred animals, and these are related 
to the vectors of breeding values for purebred animals for 
crossbred performance (mating with the other breed) as 
follows 



CAB = 0,5(Zab,ACA + ^AB,BCb) + ^AB^ 



(2) 



where the matrices Zj\^b,a ^ab,b assign purebred par- 
ents to crossbred offspring, is an «^ -dimensional vec- 
tor containing breeding values for crossbred performance 
for breed A animals (mating with breed B animals), cb 
is an «5 -dimensional vector containing breeding values 
for crossbred performance for breed B animals (mating 
with breed A animals), and the vector ^ab contains the 
Mendelian sampling effects. 
The genetic covariances are described by 



Var 
Var 



^A 
CA 



^B 
CB 

Var [^ab\ = ^AB. 



= 5:('^)(g)A^, 



and the three vectors are independent. The matrices 
and Ab are the additive relationship matrices for breed 
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A and breed S, respectively, 0 denotes the Kronecker 
product, and 



(A) ^(A) 
^11 ^12 

^21 ^22 



^11 
^21 



^12 
^22 



are the 2x2 variance-covariance matrices containing the 
genetic variances for purebred breeding values and cross- 
bred breeding values, and the covariance between the 
two, for breed A and breed S, respectively. The variance- 
covariance matrix of the Mendelian sampling term is a 
diagonal matrix D^B with elements 

. r ^ Var(c4/'(/))+Var(cHm(o) 
iDAB)ii = y^r(cABi) 



- (^22^\^^)/(0/(0 + ^22^(^5)m(0m(0)/4 



(3) 



= E^$^^(l/2-(A^V(0/(o/4) 
+ E^f (1/2 - (A5W')m(0/4), 

where for crossbred animal /, /(/) denotes the breed A 
parent and m(i) denotes the breed B parent. 

The Wei and van der Werf model is an additive genetic 
model in the sense that the breeding values for pure- 
bred performance, a^, a^, are additive genetic effects, and 
the breeding values for crossbred performance, c^, c/3, in 
combination with the genetic effects cab are also addi- 
tive genetic effects. The model therefore does not contain 
dominance genetic effects explicitly. In practice, such an 
additive genetic model may also partly capture dominant 
gene actions and other non-additive gene actions [21]. The 
fact that genetic correlations between purebred and cross- 
bred performances are different from one would be due 
to the presence of dominant gene actions in combination 
with different allele frequencies in the two breeds [22], 
in addition to genetic effects being different in different 
environments. In addition, the model captures the gen- 
eral level of heterosis in crossbred animals since it has a 
seperate fixed mean effect for crossbred animals. 

Wei and van der Werf [1] made an alternative formu- 
lation of the model. The term cab is not of interest for 
genetic evaluation when crossbred animals are not used 
for breeding, and Wei and van der Werf reformulated the 
model using €ab = ^ab-^ ^ab as the residual error term 
for the crossbred phenotypes and thereby expressed the 
model as a reduced model using only the terms a^, c^, a/3 
and cb with breeding values for purebred animals. Note 
that due to different levels of inbreeding of parents (see 
formula (3)), the term eAB has heterogeneous variance, 
and assuming a constant variance is an approximation. 
The reduced model can be implemented using software 



that handle multi- trait genetic models. For the purpose 
of this paper, observed marker genotypes on crossbred 
animals provide information on the Mendelian sampling 
term <^ab> and the absorption of this term into the resid- 
ual error term is therefore not well-suited. For this reason 
we do not follow the reduced model in this paper. 

Finally, the special case where ^^'^ = ^^'^ = ^22 can 
be formulated as 



Var 



^A 




* 




* 




* 






a^ 




0 E 


* 




_ ^21 ^ 


CA 




cb 




.^AB _ 





^21 



^12 
^12 
^22 



(g)A, 



where * denotes artificial random vectors such that 
the genetic variance-covariance matrix can be expressed 
using a Kronecker product, and A is the usual additive 
relationship matrix for all animals. This can therefore be 
implemented using a combined pedigree across all ani- 
mals. We will return to this special case in the Discussion 
section. 

Reformulated model 

Here, the Wei and van der Werf [1] model is reformulated 
using breed-specific partial relationship matrices, as in 
Garcia-Cortes and Toro [20]. Partial relationship matrices 
describe relationships according to genetic origin. 

The starting point of the reformulation is the Mendelian 
sampling term for the crossbred animals in formula 
(2). This term can be split into breed of origin effects. 



(B) 



^AB = + O^^, where and are 

independent. Formula (3) can be formulated in mat- 
rix notation as Dab = ^^^(^^ab^ ~^ ^^^(^ab^' where 



Var(<I>^^) 



(A) 



(0.51, 



0.25diag(Z^^,^A^Z^^^^)) 



and Var (^^^) = S^f (0.5I«^^ -0.25diag(Z^B,sA5ZT 
with IriAB being an identity matrix of size hab^ and 
diag(Z^23,^A^Z^^^^) and diag(Z^23,23A^Z^^^^) denot- 
ing diagonal matrices containing diagonal elements in 
matrices Z^23,^A^Z^^^^ and Z^^,^A23Z^^^^, respec- 
tively. In other words, we decompose the Mendelian sam- 
pling term into Mendelian sampling terms for the A and B 
gametes. The additive genetic effect for crossbred animals 
in formula (2) can then be expressed as 

(A) , (B) 



where c^^ = 0.5Z^^,^c^ + O^^, c^^ = 0.5Zab,bCb + 
^AB' ^AB ^AB independent, i.e. the genetic 
effects for crossbred animals is split into two breed of 



(A) (B) 
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origin genetic effects. Thus, the model equation system 
(1) can be written as: 

= XaPa + '^A^A + e^, 

YB = ^bPb + ^B^B + ^B> (4) 

V o ^ (A) , (B) , 
YAB = XabPaB + ^AB + ^AB + ^AB- 

Focusing on breed A the variance-covariance matrix of 
the genetic effects becomes 

Var(c^fj) = 025i:^^fZAB,AAAZ\s,A 

+ E^f (0.51^^^ - 0.25diag(Z^5,^A^Z^^^^)) 

— ^22 ^AB' 

where is a matrix with elements = 0.5 and 

(^AB^U' = 025(ZAB,AA^A^AB,A^i,i' ^^en / 7^ and the 
covariance matrix between and becomes 

Cov(c^f^,c^) = 0.5E^f Z^5,^A^. 

Therefore, the variance-covariance matrix of breed A 
specific genetic effects for crossbred performance equals 



Var 



CA 

JA) 
^AB 



where the symmetric {ha + ^^23) -dimensional matrix 



O^SZab,aAa 



AA) 
^AB 



is the breed A specific partial relationship matrix in 
Garcia-Cortes and Toro [20] (see below). 

Similarly, the variance-covariance matrix of breed B 
specific genetic effects for crossbred performance equals 



Var 



where 



CB 
^AB 



— ^22 ^ ' 



Kb 0.5AbZ\^^^ 



O.SZab.b^b 



^AB 



is the breed B specific partial relationship matrix (see 
below). 

Garcia-Cortes and Toro [20] presented a partition of 
the variance-covariance matrix of additive genetic values 
into breed-specific and breed-segregation terms, where 
each term is a scaled partial relationship matrix. The par- 
tial relationship matrices are constructed using recursive 
formulas similar to usual recursive formulas for the addi- 
tive relationship matrix [23] . For the two-breed terminal 
crossbreeding system, the partition results in breed A 
and breed B specific partial relationship matrices, but no 
breed-segregation partial relationship matrices; we refer 
to Garcia-Cortes and Toro [20] for the general case. The 



recursive formulas for the breed A specific partial rela- 
tionships are: 

a(^^ - fA _^_ AA) 1^ 



(A 



{A) 

'fay 



where/(/) and m{i) are the two parents of animal /, animal 
i' is not a descendant of zxvdfj^ is the breed A propor- 
tion of individual / (equal to 1 for purebred A animals, 
0 for purebred B animals and 0.5 for crossbred animals). 
To insure that partial relationship matrices are invertible, 
Munilla-Leguizamon and Cantet [24] suggested to rede- 
fine the partial relationship matrices such that only ele- 
ments that are non-null by breed origin were included, i.e. 
for the breed A specific partial relationships shown here, 
the elements related to purebred B animals are excluded. 
In this paper, we followed that suggestion, and it is not dif- 
ficult to check that the matrix in (5) is indeed the breed A 
specific partial relationship matrix. Using matrix formu- 
lation, the breed A specific partial relationship matrix is 
pJyA) _ XDT -"^ where D is a diagonal matrix with elements 
Da = 1 - (A^ffilf^.^ +^%m(i)y^ ^h^n ^ni^^l ^ is breed A 
and Da = 0.5 — when animal / is crossbred with 

breed A parent /(/)♦ For matrix T, the inverse matrix 
is a lower triangular matrix with diagonal elements equal 
to 1 and in the lower diagonal, the only non-zero elements 
are —0.5 for offspring parent elements. An example with a 
small pedigree is in Table 1, and the corresponding partial 
relationship matrices are in Tables 2 and 3. 

The reformulation of the Wei and van der Werf model 
is completed by introducing two artificial random vectors 
and such that genetic variance-covariance matri- 
ces can be presented using Kronecker products. For breed 
A the genetic covariances are described by 



Var 



^A 

^AB 

CA 
SA) 
^"-AB 



= E 



(A) 



(g)A^^), 



Table 1 Example pedigree 


Id Father 


Mother 


Breed group 


1 0 


0 


A 


2 0 


0 


A 


3 1 


2 


A 


4 0 


0 


B 


5 0 


0 


B 


6 4 


5 


B 


7 2 


6 


AB 


8 3 


6 


AB 
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Table 2 Breed A specific partial relationship matrix A^^^ 
for the pedigree in Table 1 

Id 1 2 3 7 £ 



1 

2 


1 

0 


1 


3 


1/2 


1/2 1 


7 


0 


1/2 1/4 1/2 


8 


1/4 


1/4 1/2 1/8 1/2 


and similary for breed S, the genetic covariances are 
described by 


Var 


^AB 

CB 

AB) 
^^AB J 




Implementing the model requires inverses of the two 



partial relationship matrices. The inverse of a partial rela- 
tionship matrix (A^-^^)"^ can be expressed by the usual 
formula 

^ (T-1)Td-1t-\ 

and the usual methods for computing the diagonal ele- 
ments of the partial relationship matrix and the inverse 
partial relationship matrix in sparse format [25,26] can be 
applied. 

The model is a trivariate model with breed A and 
B specific genetic effects for both purebred and cross- 
bred performances, and can be implemented using 
a software package for multivariate mixed models 
that either explicitly can construct inverses of par- 
tial relationship matrices from pedigree or alterna- 
tively can use inverse covariance matrices in sparse 
format as input (e.g., DMU [http : //dmu . agrsci . 
dk], WOMBAT [http://didgeridoo.une.edu. 
au/km/ wombat . php], ASReml [http : / / www . vsni . 
CO . uk/sof tware/asreml], blupf90 [http : //nee . 
ads . uga . edu/wiki/doku . php]), MiX99 [http : // 
www.mtt . f i/BGE/Sof tware/MiX99]. 



Table 3 Breed B specific partial relationship matrix A^^^ 
for the pedigree in Table 1 

Id 4 5 6 7 



4 


1 










5 


0 


1 








6 


1/2 


1/2 


1 






7 


1/4 


1/4 


1/2 


1/2 




8 


1/4 


1/4 


1/2 


1/4 


1/2 



Extending the model to incorporate genomic informa- 
tion requires the construction of two combined breed- 
specific partial relationship matrices expressed as inverse 
matrices, and for this purpose, marker-based breed- 
specific partial relationship matrices need to be con- 
structed, and marker-based and pedigree-based partial 
relationship matrices need to be made compatible. These 
are the topics of the following subsections. 

Marker-based partial relationship matrix 

Here, a marker-based breed A specific partial relation- 
ship matrix is constructed. The assumption here is that 
the marker genotypes for crossbred animals are phased 
such that it is known which allele originated from breed 
A and which allele originated from breed S. The marker 
genotype matrix m*^ for purebred A animals has elements 
= -1, 0 or 1 if SNP ; of individual / is 11, 12, or 22, 
respectively. For crossbred animals, the breed A marker 
allele matrix q-^ has elements q-^ = —0.5 or 0.5 if loci / of 
individual i has breed A allele 1 or 2, respectively. 

Constructing a marked-based breed-specific partial 
relationship matrix similar to the marker-based relation- 
ship matrix of VanRaden [14] is done by using the breed- 
specific alleles only. The marker-based breed A specific 
partial relationship matrix G^"^^ is divided into submatri- 
ces with indices denoting genotyped breed A and cross- 
bred animals, 



^A,A ^A,AB 
^AB,A ^AB,AB 



which are is defined as 



r^A) 


(m^ - (2/0^ - 


l)lT)(m^ - 


- - l)lT)T 


^A,A- 




sA 




MA) _ 


(m-^ - (2/0^ 


- Dl^Xq-^ 


_(^^_l)lT)T 


^A,AB - 




^A 




rSA) 


(q^ - Cp^ - 


- ^)l^)(q^ 


_(^^_l)lT)T 



''AB,AB 



(5) 



where the vector contains estimated breed A spe- 
cific allele frequencies based on marker genotypes for 
purebred animals and breed A specific marker alleles for 
crossbred animals, and 5"^ is a scaling parameter. The 
scaling parameter 5*^ is unspecified here since we adjust 
the marker-based partial relationship matrix to make it 
compatible with the pedigree-based partial relationship 
matrices, similar to Christensen et al. [17] (see below). 

The marker-based breed B specific partial relationship 
matrix G^^^ is constructed similarly. Matrices G^-^^ and 
q(B) correspond to two different covariance structures, 
while matrix G^-^^^ does not exist. For crossbred animals 
that are genotyped, the genetic effect is the sum of two 
effects, with variance-covariance matrices proportional to 
^^AB,AB ^AB,AB' respectively. Since a genetic effect 
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with a marker-based relationship matrix can be equiva- 
lently formulated as a sum of allele substitution effects, the 
genetic effect for crossbred animal / therefore equals 



_ (A) . (B) 
CAB,i — Cj^^ i -h Cj^^ - 



= U4 



(pj^ - l/2))af + (ql 



(pf - l/2))af 



where aj^.a^ are independent breed of origin specific 
substitution effects for SNP ; = 1 . . . The model for 
crossbred animals is therefore as described by Ibanez- 
Escriche et al. [7]. 

Compatibility of marlcer-based and pedigree-based partial 
relationship matrices 

Marker-based and pedigree-based partial relationship 
matrices must be compatible [17,27,28]. In order to 
achieve this, either the marker-based partial relationship 
matrix or the pedigree-based partial relationship matrix 
must be adjusted [28]. Here, we show how to adjust the 
breed A specific marker-based partial relationship matrix, 
G^'^\ to the breed A specific pedigree-based partial rela- 
tionship matrix for the subset of genotyped animals, A^'^^ 
similar to Christensen et al [17]. The adjustment is of the 
form 



G 



(A) 



1^0 ) A^n2i^n2) 



with submatrices corresponding to purebred genotyped 
and crossbred genotyped animals, 1 denoting a vector of 
ones (with sub-index denoting the dimension: ni is equal 
to the number of genotyped purebred animals and n2 to 
the number of genotyped crossbred animals); matrix K 
being implicitly defined; and a and P are parameters that 
need to be estimated. The form of the adjustment above is 
explained in Appendix A. According to Christensen et al. 
[17], the parameters a and can be determined by solving 
a system of two equations 



hi 



where A^^\ G^*^^ and K denote averages of all elements 

of the matrices A^'^^ G^*^^ and K, and dA^^\ respectively, 

{A) {A) 

and dAii , dG and dK denote averages of diagonal ele- 
ments of the three matrices, respectively. Based on K = 
(m + n2/2f/(ni + n2f and dK = (ni+ n2/4)/(«i + ^2), 
the resulting parameter estimates become 

^11 



11 (wi+W2)(«l+«2/4) 



GiA) _ ^^^^ (^1+^2/2)2 



a = 



{d/^^ -dG^^^P){ni+n2) 
(m + ^72/4) 



Note that parameter p is completely confounded with 
the scaling parameter 5"^ in (5), and the choice of the 
scaling parameter is therefore irrelevant. 

Combined pedigree-based and marker-based partial 
relationship matrix 

The combined partial relationship matrix (H^*^^) can be 
constructed similar to the combined relationship matrix 
for purebred animals [10-12]. On the inverse scale, the 
elements of the matrix for non-genotyped animals do not 
depend on the marker genotypes. Therefore, 



(A)^-l _ 



(G, 



(A). -I 



(A). -I 



0 



(6) 



(«l+«2)(«l+«2/4) 



with g£^^ = (1 - o))Gi^^ + (joA[^\ Parameter co is the 
fraction of genetic variance not captured by the marker 
genotypes, and in practice should be chosen to maximize 
accuracy and minimize bias of the resulting estimated 
breeding values [17]. 

Computation of the submatrix A^*^^ follows the Colleau 
algorithm [29,30], which is based on the decomposition 
j^iA) _ XDX^ shown in a previous subsection. The essen- 
tial idea is to compute the ith column of A^*^^ by comput- 
ing A^-^^e^, where is a vector with element / equal to 1 
and all other elements equal to 0, based on Misztal et al. 
[30]. The algorithm consists of computing consecutively 
r = T^e/ by solving the sparse system (T~^)^r = e/ for 
r, t = Dr, and finally A^-^^e/ = T^t by solving the sparse 
system (J-^f (A^^^ei) = r for A^-^^e/. 

In summary, computations for creating (H^-^^)"^ are 
straightforward. First, matrices (A^"^^)~^, G^"^^ and A^*^^ 
are computed, then G^*^^ is adjusted, G^^ = (1 — 
co)G^'^ + coA^^'^ is computed, and finally matrices A^"^^ 
and G^^ are inverted. The sparse inverse matrices 
(li(A)>^-i ^j^j (H^^^)~^ are used as input when imple- 
menting the extension of the Wei and van der Werf model. 



Discussion 

This paper demonstrates how to incorporate marker 
genotypes into the Wei and van der Werf model for 
genetic evaluation using both purebred and crossbred 
information. The approach builds on using partial rela- 
tionship matrices, and assumes that the marker genotypes 
of crossbreds can be phased such that the breed of origin 
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of alleles is known. Many different algorithms for phasing 
have been developed [31,32], and it has been shown that 
the accuracy of phasing depends among others on size of 
the sample and relatedness of animals within the sample. 

An alternative to using combined partial relationship 
matrices would be to specify one combined relationship 
matrix across all animals in the three breed groups A S 
and AB. As mentioned in the Methods section, this is 
actually a special case of the model where ^2^^ = 
With this approach, only one marker-based relationship 
matrix would have to be created and there would be no 
need to know the breed of origin of alleles. However, the 
adjustment of the marker-based relationship matrix to 
be compatible to the pedigree-based relationship matrix 
becomes more complicated when both breeds are con- 
sidered at the same time and, as mentioned, this model 
is less sophisticated than the model developed in this 
paper. 

More complicated crossbreeding systems with three 
breeds (mating crossbred AB animals with purebred C 
animals) or four breeds (mating breeds A and S, mat- 
ing breeds C and V, and finally mating the two groups 
of crossbred animals AB with CV) are typically used in 
pig and chicken production. The three-breed crossbreed- 
ing system was studied using pig data by Ibanez et al. [33], 
using the Garcia-Cortes and Toro [20] decomposition of 
the relationship matrix, and assuming breeding values 
for purebred and crossbred performances were identical. 
However, an extension of the Wei and van der Werf model 
to the three-breed crossbreeding system can be formu- 
lated as follows (only the vector containing genetic effects 
for terminal crossbreds is shown), 

_ (A) , (B) (C) (AB) 

<^{AB)C — ^(AB)C ^(AB)C ^(AB)C ^(AB)C' 

where the genetic terms cj^^^^^, c^^^^^ and c^^^^^ are 
related to the vectors containing breeding values for 
(AB)C crossbred performance for purebred animals, c^, 
and respectively, by partial relationships. The 
genetic term c|^^^^ is a breed-segregation term that is 
independent of the other genetic terms, and has variance- 
covariance matrix I]|^^j^I^^^^^^/2, where ^(^^^c is a 
parameter. Thus, the genetic parameter ^^j^^^c and the 
error variance parameter or^^^^^^ are not both identifi- 
able, c||^^^^ can be incorporated into the residual error, 
and the three breed crossbreeding model can be for- 
mulated using three breed-specific partial relationship 
matrices. Extending the three-breed crossbreeding model 
to include observed marker genotypes is currently been 
investigated. 

The model presented in this paper is an additive genetic 
model (in the sense that it considers and estimates substi- 
tution effects), but in practice it may capture both additive 



gene actions and partly dominant gene actions. Using 
purebred pig data, Su et al. [34] showed that when an 
additive genomic model was extended to explicitly incor- 
porate dominance genomic effects, improved accuracies 
of predictions of both total genetic values and breeding 
values were obtained. Using simulated data from cross- 
bred animals, Zeng et al. [9] showed that an increased 
response to selection was obtained with a genomic model 
with dominance genetic effects compared to an additive 
genomic model. Lo et al. [35] extended the Wei and van 
der Werf model to include dominance genetic effects and 
this model has been used in several studies on real data 
[36,37]. The formulation of that extension is based on 
extending the reduced model of Wei and van der Werf 
(see the Methods section) by incorporating a dominance 
genetic effect for the purebred phenotypes and a full- 
sib family effect for the crossbred phenotypes. Similar 
to the reduced model, this model formulation does not 
directly contain individual genetic effects for crossbred 
animals and is, therefore, not well-suited for incorporating 
genomic information on crossbred animals. A marker- 
based dominance relationship matrix was proposed by 
Su et al. [34], but this would need to be extended to 
a combined dominance relationship matrix, and further 
extended to a crossbreeding system. Extending the model 
in this paper to contain dominance genetic effects would 
be an interesting topic for future research. 

Conclusions 

A method for genomic evaluation of both purebred and 
crossbred performances was developed for a two-breed 
crossbreeding system. The method allows information 
from crossbred animals to be incorporated in a coherent 
manner for such crossbreeding systems. 

Appendix A 

In this appendix, we present the explanation behind 
the adjustment of the marker-based partial relationship 
matrix. Marker-based relationships, with allele frequen- 
cies equal to the observed ones, reflect relationships rel- 
ative to the genotyped animals, whereas pedigree-based 
relationships are relative to the base population of the 
pedigree. The idea behind the adjustment of the marker- 
based partial relationship matrix is to translate relation- 
ships to become relative to the base population of the 
pedigree, instead of being relative to the given set of 
animals, as suggested by Powell et al. [38], and which 
is also the idea behind the adjustment in Christensen 
et al. [17]. 

For a given set of animals (purebred and crossbred) 
and a given breed, let us assume that breed-specific 
gametes are randomly assigned to animals (purebred ani- 
mals receive two gametes each, and crossbred animals 
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receive one gamete each), and let a be equal to twice the 
gametic relationship coefficient. The partial relationship 
matrix for these animals, A^, has entries 





i = i' purebred. 




a 


/ 7^ i' purebreds. 


2. 


a/2 


/ purebred, crossbred. 




1/2 


/ = i' crossbreed. 


3. 


a/4 


/ ^ i' crossbreds. 





and can therefore be written as 



AF 



0 



2^«2 J 



(7) 



+ 



4 l«2(-'-«2)^ 



with submatrices corresponding to purebred and cross- 
bred animals, 1 being a vector of ones and I the identity 
matrix (with sub-indices denoting the dimension: ni equal 
to number of purebred animals and n2 number of cross- 
bred animals). The matrix 



0 

It 



would be a partial relationship matrix when gametes are 
unrelated [a = 0), and therefore the partial relationship 
matrix relative to the given set of animals. Hence, the for- 
mula (7) shows how relationships relative to the given set 
of animals are related to relationships relative to the base 
population of the pedigree. Therefore, it provides a for- 
mula to translate a marker-based relationship matrix (with 
allele frequencies being the observed ones) to have the 
same base population as the pedigree-based relationship 
matrix. As in Christensen et al. [17], we substitute P for 
1 — Of /2 to incorporate the scaling parameter s'^ in (5). 
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